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ABSTRACT 

In  a  cellular  wireless  system,  users  located  at  cell  edges  often  suffer  significant  out-of-cell  interference.  Assuming  each 
base  station  is  equipped  with  multiple  antennas,  we  can  model  this  scenario  as  a  multiple-input  single-output  (MISO) 
interference  channel.  In  this  paper  we  consider  a  coordinated  beamforming  approach  whereby  multiple  base  stations 
jointly  optimize  their  downlink  beamforming  vectors  in  order  to  simultaneously  improve  the  data  rates  of  a  given  group 
of  cell  edge  users.  Assuming  perfect  channel  knowledge,  we  formulate  this  problem  as  the  maximization  of  a  system 
utility  (which  balances  user  fairness  and  average  user  rates),  subject  to  individual  power  constraints  at  each  base 
station.  We  show  that,  for  the  single  carrier  case  and  when  the  number  of  antennas  at  each  base  station  is  at  least  two, 
the  optimal  coordinated  beamforming  problem  is  NP-hard  for  both  the  harmonic  mean  utility  and  the  proportional 
fairness  utility.  For  general  utilities,  we  propose  a  cyclic  coordinate  descent  algorithm,  which  enables  each  transmitter 
to  update  its  beamformer  locally  with  limited  information  exchange,  and  establish  its  global  convergence  to  a  stationary 
point.  We  illustrate  its  effectiveness  in  computer  simulations  by  using  the  space  matched  beamformer  as  a  benchmark. 
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Coordinated  Beamforming  for  MISO  Interference 
Channel:  Complexity  Analysis  and  Efficient 

Algorithms* 

Ya-Feng  Li  id,  Yu-Hong  Dai' ,  and  Zhi-Quan  Luo' 


Abstract — In  a  cellular  wireless  system,  users  located  at  cell 
edges  often  suffer  significant  out-of-cell  interference.  Assuming 
each  base  station  is  equipped  with  multiple  antennas,  we  can 
model  this  scenario  as  a  multiple-input  single-output  (MISO) 
interference  channel.  In  this  paper  we  consider  a  coordinated 
beamforming  approach  whereby  multiple  base  stations  jointly 
optimize  their  downlink  beamforming  vectors  in  order  to  simul¬ 
taneously  improve  the  data  rates  of  a  given  group  of  cell  edge 
users.  Assuming  perfect  channel  knowledge,  we  formulate  this 
problem  as  the  maximization  of  a  system  utility  (which  balances 
user  fairness  and  average  user  rates),  subject  to  individual  power 
constraints  at  each  base  station.  We  show  that,  for  the  single 
carrier  case  and  when  the  number  of  antennas  at  each  base 
station  is  at  least  two,  the  optimal  coordinated  beamforming 
problem  is  NP-hard  for  both  the  harmonic  mean  utility  and  the 
proportional  fairness  utility.  For  general  utilities,  we  propose  a 
cyclic  coordinate  descent  algorithm,  which  enables  each  trans¬ 
mitter  to  update  its  beamformer  locally  with  limited  information 
exchange,  and  establish  its  global  convergence  to  a  stationary 
point.  We  illustrate  its  effectiveness  in  computer  simulations  by 
using  the  space  matched  beamformer  as  a  benchmark. 

Index  Terms — MISO  interference  channel,  coordinated  beam¬ 
forming,  complexity,  cyclic  coordinate  descent  algorithm,  global 
convergence. 


I.  Introduction 

In  a  conventional  wireless  cellular  system,  base  stations 
from  different  cells  communicate  with  their  respective  remote 
terminals  independently.  Signal  processing  is  performed  on  an 
intra-cell  basis,  while  the  out-of-cell  interference  is  treated  as 
background  noise.  This  architecture  often  causes  undesirable 
service  outages  to  users  situated  near  cell  edges  where  the  out- 
of-cell  interference  can  be  severe.  Since  the  conventional  intra¬ 
cell  signal  processing  can  not  effectively  mitigate  the  impact 
of  inter-cell  interference,  we  are  led  to  consider  coordinated 
base  station  beamforming  across  multiple  cells  in  order  to 
improve  the  services  to  edge  users.  In  this  paper,  we  focus  on 
the  downlink  scenario  where  the  base  stations  are  equipped 
with  multiple  antennas  and  model  it  as  a  MISO  interference 
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channel.  We  consider  joint  optimal  beamforming  across  multi¬ 
ple  base  stations  to  simultaneously  improve  the  data  rates  of  a 
given  group  of  cell  edge  users.  Assuming  that  the  channel  state 
information  (CSI)  is  known,  we  formulate  this  problem  as  the 
maximization  of  a  system  utility  (which  balances  user  fairness 
and  average  user  rates),  subject  to  individual  power  constraints 
at  each  base  station.  We  show  that,  for  the  single  carrier  case 
and  when  the  number  of  antennas  at  each  base  station  is  at  least 
two,  the  optimal  coordinated  beamforming  problem  is  NP- 
hard  for  both  the  harmonic  mean  utility  and  the  proportional 
fairness  utility.  This  NP-hardness  result  is  in  contrast  to  the 
single  antenna  case  for  which  the  same  optimization  problem 
is  convex  for  both  the  harmonic  mean  and  proportional  fairness 
utility  functions  [1],  For  the  min-rate  utility,  this  problem  is 
known  to  be  also  solvable  in  polynomial  time  [1],  [2], 

In  addition  to  the  complexity  analysis,  we  propose  a  prac¬ 
tical  iterative  cyclic  coordinate  descent  algorithm  for  the 
multi-cell  coordinated  beamforming  problem  by  exploiting 
the  separability  of  power  constraints.  We  prove  the  global 
convergence  of  this  cyclic  coordinate  descent  algorithm  (to 
a  stationary  point).  Numerical  experiments  are  also  presented 
to  illustrate  the  effectiveness  of  the  proposed  algorithm. 

A.  Related  Work 

Downlink  beamforming  has  been  studied  extensively  in  the 
single  cell  setup  [3],  [4],  For  the  multi-cell  interference  chan¬ 
nel,  the  reference  [5]  considered  coordinated  beamforming  for 
the  minimization  of  total  weighted  transmitted  power  across 
the  base  stations  subject  to  individual  signal-to-interference- 
plus-noise-ratio  (SINR)  constraints  at  the  remote  users.  It  turns 
out  this  problem  can  be  transformed  into  a  convex  second 
order  conic  programming  (SOCP)  and  efficiently  solved.  How¬ 
ever,  the  maximization  of  weighted  sum  rates  for  a  multi-cell 
interference  channel  under  individual  power  constraints  is  NP- 
hard  even  for  the  single  antenna  and  the  single  carrier  case  [1], 
In  fact,  more  is  known  about  the  single  antenna  interference 
channel  case.  For  instance,  if  the  system  utility  is  changed  into 
either  the  geometric  mean  rates  (i.e.,  proportional  fairness), 
the  harmonic  mean  rates,  or  the  min-rate,  the  corresponding 
utility  maximization  problem  (for  the  single  tone  case)  can 
be  converted  to  a  convex  optimization  problem  and  solved 
efficiently  to  global  optimality  [1],  [6].  However,  when  the 
number  of  tones  is  more  than  two,  then  all  of  the  aforemen¬ 
tioned  power  control  problems  are  NP-hard.  The  focus  of  this 
paper  is  to  study  the  multi-antenna  case  (MISO  interference 
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channel),  analyze  the  complexity  of  the  corresponding  utility 
maximization  problems,  and  propose  a  practical  algorithm  to 
solve  them. 

In  addition  to  the  aforementioned  utility  based  formulations, 
various  base  station  cooperation  techniques  have  been  pro¬ 
posed  to  mitigate  inter-cell  interferences,  including  multi-point 
coordinated  transmission,  or  network  multi-input  multi-output 
(MIMO)  transmission  [7-16],  For  example,  the  distributed 
or  decentralized  approaches  are  proposed  for  coordinated 
transmitter  beamforming  vectors  in  MISO  interference  channel 
in  [7],  [10],  [13],  [14],  [16],  [17],  some  of  which  are  based 
on  dual  uplink  channels.  In  particular,  a  distributed  pricing 
algorithm  for  power  control  and  beamformer  design  in  the 
MISO  interference  networks  is  proposed  in  [17],  At  each 
iteration,  transmitters  try  to  maximize  their  utility  minus  the 
total  interference  cost,  i.e.,  the  summation  of  the  interference 
price  times  the  received  interference,  and  the  interference  price 
indicates  the  marginal  decrease  in  the  corresponding  user’s 
utility  due  to  a  marginal  increase  in  interference  associated 
with  the  particular  receiver. 

The  references  [11],  [12]  show  that  coordination  enables 
the  cellular  network  to  enjoy  a  greater  spectral  efficiency. 
Most  of  these  cooperative  techniques  require  each  base  station 
to  have  not  only  full/partial  CSI  but  also  the  knowledge  of 
actual  independent  data  streams  to  all  remote  terminals.  With 
the  complete  sharing  of  data  streams  and  CSI,  the  multi-cell 
scenario  is  effectively  reduced  to  a  single  cell  interference 
management  problem  with  either  total  [18]  or  per-group-of- 
antenna  power  constraints  [19],  [20],  Among  the  major  draw¬ 
backs  of  these  techniques  (in  comparison  to  the  utility  based 
approaches)  are  their  stringent  requirement  on  base  station  co¬ 
ordination,  the  large  demand  on  the  communication  bandwidth 
of  backhaul  links,  as  well  as  the  heavy  computational  load 
associated  with  the  increasing  number  of  cells  [21],  [22].  The 
references  [8],  [15],  [23],  [24]  provided  characterizations  of 
the  achievable  rate  region  and  proved  the  existence  of  a  unique 
Nash  equilibrium  which  is  inefficient  in  the  sense  that  the 
achievable  rates  are  bounded  by  a  constant,  regardless  of  the 
available  transmit  power.  See  [23],  [25]  for  the  recent  results 
of  the  MISO  channel. 

Notation:  The  notation  for  this  paper  is  as  follows:  Lower 
case  boldface  is  used  for  column  vectors.  ( • )7  and  (-)t  denote 
transpose  and  Hermitian  transpose,  (x,  y)  denotes  a  two- 
dimensional  row  vector.  ||  •  ||  denotes  the  Euclidean  norm.  As¬ 
suming  /(x)  is  a  multi-variable  function,  V/(x)  and  V2/(x) 
denote  its  gradient  and  Hessian,  respectively. 

II.  Problem  Formulation 

Consider  a  cellular  system  in  which  there  are  K  base 
stations  each  equipped  with  L  transmit  antennas.  The  K  base 
stations  wish  to  transmit  respectively  to  K  mobile  receivers 
each  having  only  a  single  antenna.  Each  base  station  can  direct 
a  beam  to  its  intended  receiver  in  such  a  way  that  the  resulting 
interference  to  the  other  mobile  units  is  small.  Consider  the 
single  carrier  case,  and  let  h jk  €  CL  denote  L-dimensional 
complex  channel  vector  between  base  station  j  and  receiver 
k.  Let  vk  £  CL  denote  the  beamforming  vector  used  by 


base  station  k,  while  a /,,  is  a  complex  scalar  denoting  the 
information  signal  for  user  k  with  E|sfc|2  =  1.  The  transmitter 
vector  of  the  j-th  base  station  is  v7  s:j .  Then  the  signal  received 
by  user  k  can  be  described  as 

K 

Z/fc  =  E  hjfe  (vj-Sj)  +  zk,  1  <  k  <  K,  (1) 

j= 1 

where  zk  is  the  additive  white  Gaussian  noise  (AWGN)  with 
variance  <t2/2  per  real  dimension.  Treating  interference  as 
noise,  we  can  write  the  SINR  of  each  user  as 


SINRfc 


lhLvfci2 

CTfc  +  E^fclhjfcvil2 


(2) 


Adopting  an  utility,  we  can  formulate  the  optimal  coordinated 
downlink  beamforming  problem  as 


max  H(r1,r2,...,rK) 
s.t.  rk  =  log  ( 1  + 


lhIfevfe|S 


*1  +  E 


iL,  Vi 


(3) 


j^k  \jk  3 


1 1  1 1 2  <  Pfc,  1  <  k  <  K, 


where  Pk  denotes  the  power  budget  of  base  station  fc,  and  H(-) 
denotes  the  system  utility  which  may  be  any  of  the  following 

K 


Weighted  sum-rate  utility:  H\  =  — wkfk,  with 
weight  Wk  >  0. 


fc= l 


•  Proportional  fairness  utility:  H2 
1  K 

^Elosrfc- 

k= 1 


1/K 


•  Harmonic  mean  utility:  //:!  = 

•  Min-rate  utility:  H4  =  min 

l<k<K 


K!  (E 

\k=  1 


According  to  [23],  [24],  problem  (3)  can  be  written  in  a  more 
general  form 

max  H(ri,r2,...,rK) 


s.t. 


rk  =  log 


h-kkV  kfakk 


(4) 


Trace(Vfc)  <  Pk,  Vk  *=  0,  1  <  k  <  K, 


where  is  the  transmit  covariance  matrix  at  transmitter  k. 
The  results  in  [23],  [24]  state  that  problem  (4)  has  a  rank-one 
optimal  solution  for  each  V k.  This  implies  that  problem  (3) 
and  problem  (4)  are  equivalent.  We  focus  on  formulation  (3) 
in  this  paper. 

The  above  beamforming  problem  (3)  can  be  nonconvex 
in  general  due  to  the  nonlinear  equality  constraints.  Given 
nonnegative  weights  (wi,  w2,  ...,wjc),  the  optimal  tuple  of 
rates  (rq,  r2, Tk)  of  (3)  should  lie  on  the  boundary  of 
the  achievable  rate  region.  See  [8],  [9],  [15],  [23],  [24]  for 
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various  effort  to  characterize  the  achievable  rate  region  of  the 
interference  channel. 

In  practice,  the  choice  of  utilities  depends  on  a  suitable  com¬ 
promise  between  system  performance  (total  rates  achievable) 
and  user  fairness.  The  sum-rate  utility  Hi  focuses  entirely 
on  system  performance,  while  the  min-rate  utility  Hi  places 
the  highest  emphasis  on  user  fairness.  The  other  two  choices 
Il2  and  //•;  represent  an  appropriate  tradeoff  between  the  two 
extremes. 

III.  Complexity  Analysis 

In  this  section,  we  investigate  the  complexity  status  of  the 
optimal  coordinated  downlink  beamforming  problem  (3)  under 
various  choices  of  system  utilities.  We  provide  a  complete 
analysis  on  when  the  problem  is  NP-hard  and  also  identify 
subclasses  of  the  problem  that  are  solvable  in  polynomial  time. 

A.  Computational  complexity  theory:  a  brief  background 

Genetically,  an  optimization  problem  can  be  described  by 
the  minimization  of  an  objective  function  over  a  feasible 
region.  A  decision  version  of  the  minimization  problem  is 
to  decide  if  the  feasible  region  contains  a  vector  at  which 
the  objective  function  value  is  below  a  given  threshold.  The 
answer  to  the  decision  problem  is  binary,  true  or  false,  and 
there  is  no  need  to  identify  what  the  solution  is.  The  decision 
version  is  typically  easier  to  solve  than  the  original  optimiza¬ 
tion  problem  which  requires  the  determination  of  an  (globally) 
optimal  solution.  The  size  of  an  optimization  problem  instance 
is  defined  as  the  minimum  length  of  a  binary  string  required  to 
describe  the  objective  function  and  the  feasible  region.  We  say 
an  algorithm  solves  the  decision  version  of  an  optimization 
problem  if  for  each  instance  of  the  problem,  the  algorithm 
correctly  gives  “true”  or  “false”  answer.  We  can  define  the 
running  time  of  an  algorithm  as  the  maximum  number  of  basic 
computational  steps  (e.g.,  number  of  arithmetic  operations) 
required  to  solve  the  decision  version  of  an  optimization 
problem  of  a  given  size.  Typically,  the  algorithm’s  running 
time  is  a  function  of  the  problem  size. 

In  the  computational  complexity  theory  [26],  [30],  there  are 
two  important  classes  of  optimization  problems,  P  and  NP.  The 
class  P  contains  optimization  problems  which  are  solvable  (or 
decidable)  by  an  algorithm  whose  running  time  grows  at  most 
as  a  polynomial  function  of  the  input  size.  It  turns  out  that  the 
class  P  is  rather  robust  to  the  actual  definition  of  input  size. 
For  example,  P  is  invariant  if  we  alternatively  define  the  input 
size  of  an  optimization  problem  as  the  sum  of  the  problem 
dimension,  the  number  of  constraints  and  the  binary  length 
of  the  input  data  and  threshold  value.  We  say  an  algorithm 
is  a  pseudo  polynomial  time  algorithm  if  its  mnning  time 
is  a  polynomial  when  we  bound  the  size  of  numbers  in  the 
input  by  a  constant.  For  example,  many  dynamic  programming 
algorithms  (e.g.,  Viterbi  algorithm)  are  pseudo  polynomial 
time  since  their  state  space  are  typically  exponential  if  the 
numbers  in  the  input  are  not  bounded. 

The  class  NP,  which  stands  for  Nondeterministic  Polynomial 
time,  consists  of  decision  version  of  optimization  problems 
whose  “true”  instances  can  be  verified  in  polynomial  time. 


assuming  the  availability  of  a  feasible  solution  that  meets 
the  threshold  requirement.  More  formally,  we  say  a  nondeter¬ 
ministic  algorithm  solves  a  decision  version  of  optimization 
problem  if  we  can  verify  each  “true”  instance  of  the  problem 
using  a  sequence  of  nondeterministic  steps  (i.e.,  involving 
random  guesses).  If  the  number  of  nondeterministic  steps  is 
polynomial,  then  the  algorithm  is  said  to  have  a  nondetermin¬ 
istic  polynomial  running  time.  For  example,  for  any  symmetric 
matrix  Q  €  R"x"  and  a  threshold  value  L,  consider  the  prob¬ 
lem  of  deciding  if  there  exists  a  binary  vector  x  £  {—1,1}" 
such  that  xtQx  <  L.  A  nondeterministic  algorithm  to  solve 
this  problem  is  to  guess  a  binary  vector  x  and  then  check 
if  xtQx  <  L  indeed  holds.  Such  a  binary  vector  x  exists 
for  all  “true”  instances  of  the  problem,  and  the  verification 
process  requires  polynomially  many  steps  although  some  steps 
may  involve  a  random  guess  (of  a  component  of  x).  In  this 
case,  the  problem  is  solvable  in  nondeterministic  polynomial 
time.  The  class  NP  contains  precisely  those  decision  version 
of  optimization  problems  that  are  solvable  in  nondeterministic 
polynomial  time.  Clearly,  P  is  contained  in  NP.  It  is  widely 
conjectured  that  P  ^  NP,  or  equivalently,  there  are  problems  in 
NP  which  are  not  solvable  in  (deterministic)  polynomial  time. 

A  subset  of  problems  in  NP  are  called  NP -complete.  NP- 
complete  problems  are  those  in  NP  which  are  most  difficult 
to  solve,  in  the  sense  that  if  any  one  of  them  is  in  P,  so  is 
every  other  problem  in  NP.  There  are  many  well  known  NP- 
complete  problems  such  as  the  traveling  salesman  problem, 
the  3-colorability  problem,  and  so  on.  The  latter  problem  is 
to  decide  if  the  nodes  of  a  given  graph  can  be  colored  in 
three  colors  so  that  each  adjacent  pair  of  nodes  are  colored 
differently.  The  3-colorability  problem  is  clearly  in  NP  since 
we  can  easily  check  if  a  guessed  coloring  scheme  meets  the 
requirement.  There  is  no  known  polynomial  time  algorithm 
to  solve  the  3-colorability  problem.  In  fact,  if  this  problem  is 
solvable  in  polynomial  time  (i.e.,  in  P),  then  every  problem  in 
NP  is  solvable  in  polynomial  time,  or  equivalently  P  =  NP.  A 
problem  V  is  said  to  be  NP-hard  if  it  is  at  least  as  hard  as  those 
NP-complete  problems,  which  means  that  the  polynomial  time 
solvability  of  V  would  imply  every  NP-complete  problem  is 
in  P.  A  NP-hard  problem  may  not  be  in  NP.  For  example,  the 
binary  quadratic  minimization  problem  minx6  r  x  xTQx 
is  NP-hard,  since  it  is  not  known  to  be  in  NP,  and  is  at  least  as 
hard  as  the  NP-complete  problem  of  deciding  if  there  exists 
a  binary  vector  x  such  that  x7  Qx  <  L,  where  the  threshold 
value  L  is  given.  A  problem  is  called  strongly  NP-hard  if 
it  cannot  be  solved  by  a  pseudo  polynomial  time  algorithm 
unless  P  =  NP. 

To  prove  a  problem  V  is  NP-complete,  we  need  to  show 
two  things.  First,  we  verify  the  problem  is  in  NP.  This  step  is 
usually  easy.  Second,  we  need  to  show  V  is  at  least  as  difficult 
as  a  known  NP-complete  problem.  This  can  be  accomplished 
by  a  standard  technique  called  polynomial  time  transformation. 
In  a  polynomial  time  transformation,  we  pick  a  known  NP- 
complete  problem  and  show  that  it  is  equivalent  to  a  special 
case  of  V .  More  precisely,  we  take  an  arbitrary  instance  of 
a  known  NP-complete  problem,  construct  a  special  instance 
(with  polynomial  size)  of  V,  and  then  establish  the  equivalence 
of  the  two  instances.  To  show  a  problem  is  NP-hard,  we  simply 
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ignore  step  1,  as  there  is  no  need  to  show  V  is  in  NP.  To  show 
V  is  strongly  NP-hard,  we  need  to  additionally  ensure  the 
special  instance  of  V  we  construct  involves  only  numbers  of 
constant  size,  i.e.,  numbers  whose  combined  bit  length  does 
not  increase  with  the  problem  dimension  or  the  number  of 
constraints. 


B.  Maximization  of  the  Weighted  Sum-Rate  Utility 


K 


Consider  the  system  utility  H\  =  —  Wk'i'k-  In  the  single 

K 


fc= 1 

antenna  case  ( L  =  1),  the  original  system  optimization  prob¬ 
lem  (3)  becomes  the  following  (5),  where  xk  =  ||  || 2 ,  ajk  = 
llhjfe ||2/||hfcfc ||2  and  yk  =  crj?/||hfcfc||2-  Problem  (5)  is  known 
to  be  NP-hard  [1]  even  when  the  weights  wk  are  all  equal,  and 
the  proof  is  based  on  a  polynomial  time  transformation  from 
the  maximum  independent  set  problem  (which  is  known  to 
be  NP-complete).  Thus,  the  general  case  of  L  >  1  is  also 
NP-hard.  For  various  special  MISO  channels,  the  sum-rate 
maximization  problem  can  still  be  solved  in  polynomial  time, 
see  [25],  [27-29], 


C.  Maximization  of  the  Harmonic  Mean  Utility 

We  now  study  the  complexity  status  of  the  optimal  coor¬ 
dinated  downlink  beamforming  problem  (3)  defined  by  the 
harmonic  mean  rate  utility. 

Theorem  3.1  (Harmonic  Mean  Utility):  For  the  harmonic 

/  K  _  \ 

mean  utility  //:i  =  K/  I  rk  1  ) ,  the  optimal  coordinated 

Vfc=i  / 

downlink  beamforming  problem  can  be  transformed  into  a 
convex  optimization  problem  when  L  =  1,  but  is  NP-hard 
when  L  >  2. 

When  there  is  only  one  transmit  antenna  ( L  =  1),  the 
reference  [1]  shows  that  the  harmonic  mean  rate  maximization 
can  be  transformed  into  an  equivalent  convex  problem.  We 
thus  focus  on  the  case  L  >  2.  Notice  that  the  harmonic  mean 
utility  maximization  problem  is  a  continuous  optimization 
problem.  To  show  its  NP-hardness,  we  need  to  transform 
a  known  NP-hard  discrete  problem  to  the  harmonic  mean 
maximization  problem.  To  facilitate  this  transformation,  it  is 
necessary  to  induce  certain  discrete  structure  to  the  optimal 
solutions  to  the  harmonic  mean  maximization  problem.  This 
is  accomplished  by  using  the  concavity  of  the  harmonic 
mean  utility  with  respect  to  each  beamforming  vector  vk.  In 
particular.  Lemma  3.2  (proved  in  Appendix  I)  shows  that  we 
can  constrain  the  optimal  beamforming  vectors  to  be  taken 
from  two  orthogonal  vectors  h0  or  h/,. 

The  NP-hardness  proof  of  Theorem  3.1  is  based  on  a 
transformation  from  a  variant  of  the  3-SAT  [30]  problem. 
To  describe  this  variant,  we  need  to  define  the  UNANIMITY 
property  and  the  NAE  (stands  for  “not- all -equal”)  property  of 
a  disjunctive  clause1. 

Definition  3.1 :  For  a  given  truth  assignment  to  a  set  of 
Boolean  variables,  a  disjunctive  clause  is  said  to  be  UNANI¬ 
MOUS  if  all  literals  in  the  clause  have  the  same  value  (whether 

'Recall  that  for  a  given  set  of  Boolean  variables,  a  literal  is  defined  as 
either  a  Boolean  variable  or  its  negation,  while  a  disjunctive  clause  refers  to 
a  logical  expression  consisting  of  the  logical  “OR"  of  literals. 


it  is  the  True  or  the  False  value).  Otherwise  it  is  said  to  be 
satisfied  in  the  NAE  (Not-All-Equal)  sense. 

Notice  that  a  disjunctive  clause  must  be  satisfied  if  it  is  to  have 
the  NAE  property.  We  now  define  some  decision  problems 
over  Boolean  variables. 

Definition  3.2:  MAX-UNANIMITY  problem :  given  a  pos¬ 
itive  integer  M  and  m  disjunctive  clauses  defined  over  n 
Boolean  variables,  we  ask  whether  there  exists  a  truth  assign¬ 
ment  such  that  the  number  of  unanimous  disjunctive  clauses  is 
at  least  M.  When  the  number  of  literals  in  each  clause  is  two, 
we  denote  the  corresponding  problem  as  MAX-2UNANIMITY 
problem.  When  each  clause  contains  three  literals,  the  problem 
of  determining  whether  there  exists  a  truth  assignment  under 
which  at  least  M  clauses  are  satisfied  in  the  NAE  sense  is 
called  the  NAE-SAT  problem. 

The  NAE-SAT  problem  is  known  to  be  NP-complete  [30]. 
Our  next  lemma  says  that  the  MAX-2UNANIMITY  problem 
is  also  NP-complete. 

Lemma  3.1:  The  MAX-2UNANIMITY  problem  is  NP- 
complete. 

Proof:  We  construct  a  polynomial  time  transformation 
from  the  NAE-SAT  problem.  It  can  be  checked  that  the  MAX- 
2UNANIMITY  problem  is  in  the  class  NP.  Given  a  disjunctive 
clause  with  three  literals,  c  =  x  V  y  V  z,  let  us  construct  the 
following  six  clauses,  each  involving  only  two  literals: 

R(c)  :  x  V  y,  x  V  z,  yVx,  yVz ,  z  V  x,  z\/  y.  (6) 

It  can  be  checked  that  R(c )  has  the  following  properties: 

1)  The  number  of  unanimous  clauses  (i.e.,  all  literals  hav¬ 
ing  the  same  value)  in  R(c)  is  at  most  four. 

2)  The  clause  c  is  satisfied  in  the  NAE  sense  if  and  only 
if  the  number  of  unanimous  clauses  in  f?(c)  is  four. 

Now  given  any  instance  <j>  of  NAE-SAT  problem,  we  con¬ 
struct  a  corresponding  instance  R(<jf)  of  MAX-2UNANIMITY 
problem  as  follows:  for  each  clause  c  =  a  V  /?  V  7  of  <p,  we 
add  to  R((f>)  the  six  clauses  in  (6),  with  x,  y ,  z  replaced 
with  the  literals  a,  /3,  7  respectively.  In  this  way,  if  f  has 
to  clauses,  then  R(<j>)  will  have  6 to  clauses.  Let  M  =  4 to. 
Then  properties  1  and  2  imply  that  all  clauses  in  are 
simultaneously  satisfiable  in  the  NAE  sense  if  and  only  if  at 
least  M  =  4to  clauses  in  R(<j>)  can  be  made  unanimous.  In 
particular,  suppose  that  4to  clauses  in  Rid)  are  unanimous 
under  a  given  truth  assignment.  Since,  by  property  1,  each 
group  R(c)  of  six  clauses  can  have  at  most  four  unanimous 
clauses,  it  follows  that  exactly  four  clauses  must  be  unanimous 
in  each  group.  By  property  2,  this  further  implies  that  each 
clause  in  <f>  is  satisfied  in  NAE  sense.  Conversely,  any  truth 
assignment  that  satisfies  a  clause  c  in  the  NAE  sense  will  give 
rise  to  four  unanimous  clauses.  Thus,  if  all  to  clauses  in  <p  are 
satisfied  in  the  NAE  sense,  then  there  will  be  4to  unanimous 
clauses  in  R(<j>).  Finally,  this  transformation  is  in  polynomial 
time.  ■ 

An  immediate  corollary  of  Lemma  3.1  is  that  the  MAX- 
UNANIMITY  problem  is  NP-complete.  However,  if  we  ask 
whether  all  of  the  m  clauses  (i.e.,  M  =  to)  can  be  made 
unanimous,  then  the  corresponding  MAX-UNANIMITY  prob¬ 
lem,  simply  called  UNANIMITY  problem,  can  be  solved  in 
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max 


K  ( 

El  Xfc 

wkrk ,  subject  to  rk  =  log  1  H - 

fc=i  V  7fc  +  ^ 


j^k  ajkXj 


0<xk<  Pk,  1  <  k  <  K, 


(5) 


polynomial  time  (using  a  tree  search  technique).  Also,  notice 
that  a  disjunctive  clause  is  unanimous  under  a  given  truth 
assignment  if  and  only  if  c  is  not  satisfied  in  NAE  sense.  This 
implies  that  MIN-3UNANIMITY  problem  is  NP-complete. 
Since  these  results  are  not  needed  in  the  subsequent  analysis, 
we  state  them  without  proof. 

Proposition  3.1:  The  following  is  true: 

1)  MAX-UNANIMITY  problem  is  NP-complete. 

2)  MIN-3UNANIMITY  problem  is  NP-complete. 

3)  UNANIMITY  problem  is  solvable  in  polynomial  time. 

We  are  now  ready  to  prove  Theorem  3.1.  Proof:  Let 

the  utility  in  (3)  be  given  by  the  harmonic  mean  function  f/:!. 
Consider  an  instance  of  MAX-2UNANIMITY  problem  <i>  with 
clauses  Ci,C2,...,cm  defined  over  the  variables  x\,  x^,  xn 
and  an  integer  M.  Let  ha  =  (1,0)T,  =  (0,1)T  and 

h  =  (y/N,  0)T,  where  N  is  a  large  positive  number  (to  be 
specified  later).  We  write  each  clause  Cj  =  aj\/f3j,  with  aij,  f3j 
taken  from  {ati ,  X2,  ...,  xn,  xi,  X2,  ■  xn}.  Let  us  define  two 
mappings 


channel  vectors  are  h  =  ( \/7v,  0)7  and  their  noise  powers  are 
1.  Let  V4 i,i  =  1,2 n,  denote  the  transmit  beamforming 
vector  of  the  “variable  user”  4 i.  For  example,  the  clause 
ci  =  X2  V  5:3  corresponds  to  j  =  1  and  is  associated  with  two 
“clause  users”  which  are  denoted  by  4 n  +1,  4 n  +  2.  Since 
7r (1)  =  2  and  r(l)  =  —3,  the  two  users  4 n  +  1  and  4n  +  2 
experience  interferences  from  “variable  users”  4|-7r(l)|  =  8 
and  4|t(1)|  =  12.  The  corresponding  interference  terms  for 
these  two  “clause  users”  4n+l,  4n  +  2  are  |hj,V8|2  +  |h^Vi2|2 
and  Ih^vgl2  +  |h^v12|2,  respectively. 

The  correspondence  between  MAX-2UNANIMITY  prob¬ 
lem  and  the  optimal  coordinated  downlink  beamforming  prob¬ 
lem  (7)  is  listed  in  Table  I.  Notice  that  r4n+2j  can  be  obtained 
from  clause  Cj  according  to  Table  I  and  r4n+2j-i  can  be 
obtained  from  r4„+2j  by  swapping  ha  with  h^. 

TABLE  I 

Variable  Correspondence 


t r,  t  :  {1, 2, ...,  m}  i->  {±1,  ±2, ±n} 


such  that 


*U) 


i, 


if  Ql j  —  2/^ 
if  Otj  —  21^ 


and  r(j) 


i ,  if  Pj  =  Xi, 

-i,  if  (3j  —  Xi. 


For  instance,  if  c4  =  X3VX5.  then  we  have  a4  =  X3,  @4  =  X5, 
with  7t(4)  =  3  and  r(4)  =  —5.  For  i  =  ±1,±2, ....  ±n,  we 
define 


\  h a,  if  i  >  0, 
\  hb,  if  i  <  0. 


Given  an  instance  of  MAX-2UNANIMITY  problem,  we  con¬ 
struct  the  following  (7)  as  an  instance  of  (3)  (the  inverse 
of  harmonic  mean  utility  minimization  is  equivalent  to  the 
harmonic  mean  rate  utility  maximization)  with  a  total  of 
K  =  An  +  2 to  users.  Herein,  each  Boolean  variable  a:,  corre¬ 
sponds  to  four  users,  including  user  4*  (called  “variable  user”) 
and  user  4 *  —  1,4*  —  2  and  4*  —  3  (called  “auxiliary  variable 
users”);  while  each  clause  Cj  corresponds  to  a  pair  of  users, 
i.e.,  user  An  +  2 j  and  An  +  2 j  —  1  (called  “clause  users”). 
In  our  construction  (7),  each  (variable,  auxiliary  variable  or 
clause)  user  k  is  associated  with  a  transmitter  beamforming 
vector  Vfc,  k  =  1,  2, ...,  4?*  +  2 to. 

In  (7),  the  n  “variable  users”  communicate  interference 
free.  Their  channel  vectors  are  (a/0.9,  a/0.9)1  and  their 
noise  power  are  1.  The  3?*  “auxiliary  variable  users” 
4*  —  1, 4*  —  2, 4*  —  3,  *  =  1,  2, ...,  n,  do  suffer  from  crosstalk 
interference  from  the  “variable  user”  4*.  That  is,  the 
interference  channel  vectors  from  the  “variable  user”  4*  are 
(1, 1)T,  (1,  0)T,  (0, 1)T,  respectively;  the  direct  link  channel 
vectors  are  (1, 0)T,  (10, 0)T  and  (10,0)T;  the  self  noise 
power  at  each  “auxiliary  variable  user”  is  zero.  For  the 
“clause  users”  An  +  2 j  and  An  +  2j  —  1,  j  =  1, 2, ...,  m,  their 


MAX-2UNANIMITY 

Problem  (7) 

variable  Xi 
clause  Cj 

beamforming  vector  V4 i 
rates  r4n+2j  and  r4n+2j-i 
interference  term  |haV4i|2 

literal  Xi 

literal  Xi 

interference  term  |h£v4i|2 

We  first  fix  some  easy  variables  of  (7)  to  simplify  the 
problem.  Since  each  of  the  beamforming  vectors  v4?l+2j, 
v4n+2j— 1,  j  =  1,2,  ...,m,  and  v4i-u  *  =  1,2 ,...,n,l  = 
1,2,3,  appears  exactly  once  in  the  objective  function,  it 
follows  that,  by  optimality,  these  beamforming  vectors  must 
be  matched  to  the  corresponding  channel  vectors.  That  is, 

V*4n+2j  =  V4n+2i-l  =  (1,  0)T,  and  V*^  =  V*^  = 

v4i_3  =  (1,0)T.  Substituting  these  optimal  beamforming 
vectors  into  (7)  yields  (8).  It  only  remains  to  determine  the 
optimal  beamforming  vectors  v4i,  *  =  1,2,  ...,n.  For  this 
purpose,  we  need  the  following  key  lemma  whose  proof  is 
relegated  to  Appendix  I. 

Lemma  3.2:  If  N  >  2(e40m  —  1),  then  the  optimal  beam¬ 
forming  vectors  {vj,}  for  the  optimization  problem  (8)  must 
be  either  ha  =  (1, 0)T  or  h{,  =  (0, 1)T. 

It  follows  from  Lemma  3.2  that  the  optimal  beamforming 
vector  vjj  of  (8)  is  either  ha  or  Iq,.  In  either  case,  the  sum  of 
inverse  rates 

11  1  1  _  1  1  1 
r4i  r  4j_i  r4i-2  r4i-3  log  1.9  log  101  log  2 

=  C 

is  a  constant  C.  Thus,  regardless  of  whether  vj,-  =  ha  or 
h/,,  the  first  sum  in  the  objective  function  of  (8)  remains 
unchanged  and  equals  nC.  Thus,  we  only  need  to  consider 
the  second  sum  in  the  objective  function  of  (8).  Notice  that 
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min 


E 


1111 

— i - 1 - 1 - 

f4i  r4i-i  T4i-2  r4i_3 

s.t.  r4i  =  log  (l  +  |(\/ol9,  v/0^9)v4i;|2^ 
I(1,0)v44_1|2' 
l(l,l)v4l|2 

|(10,0)v4i-2 

|(1, 0)v4i|2 

,  M  ,  1(10, 0)v4i_3|2 

r<‘-3Jl08|1+  |(0, 1)v4,|2 


\r4n+2j  r4„+2i-l 


Ui- 1  =  log  ^1  + 
r4%-  2  =  log  ( 1  + 


r-4n+2j  =  log  ^1  + 
r4„+2j-i  =  log  (l  + 


_ |htV4n+2j|2 _ 

1  +  lht(j)V4k(l)||2  +  lhr(j)V4|'r(i)|  I 


|htV4n+2j-l 


!  +  lhEy)V4k(3)|l2  +  lht 


-r(4')V4lr(i)l  I 


||vfc||2  <1,  1  <  i  <n,  1  <  j  <  to,  1  <  k  <  4n  +  2m. 


(7) 


min 

s.t. 


1 

1"4i-3 


1 

t"4n+2j 


1 

t'4n+2j-l 


r4i  =  log  (1  +  0.9 1 (1,  l)v4j|2)  , 
r4i~i  =  log  (l  +  1/|(1,  l)v4i|2)  , 
r4i-2  =  log  (l  +  100/|(1, 0)v4j; |2)  , 
r4i-3  =  log  (1  +  100/1(0,  l)v4j;  j2)  , 

T4n+2J  =  log  (l  +  N/(l+  |hty)V4|w(i)||2  +  I h| (j) V4| t(j) 1 1 2)  )  , 
P4n+2j-l  =  log  (l  +  N/(l  +  |ht_7r(i)V4k(j)||2  +  |ht_r0-)V4|T(i)||2)), 


||v4i||2  <1,  1  <  i  <  n,  1  <  j  <  m. 


(8) 


1  1 

- 1 - 

T  An-\-2j  ^4n-\-2j—  1 


1 

log  (1 1  N/ 3) 
log  (1  +  N/2) 


- - — - — t —  ■  if  Cn  is  unanimous, 

log  (1  +  N/l)  3 

- ; - — .  if  c,  is  not  unanimous. 

log  (1  +  N/2)  3 


the  value  of  each  term  in  the  second  sum  only  depends  on 
whether  clause  Cj  is  unanimous  (see  the  second  equation  in 
the  next  page).  Since 

1  ,  1  ^  2 
log  (1  +  N/ 3)  +  log  (1  +  N)  <  log  (1  +  N/2) 

from  Claim  1  in  Appendix  I,  it  follows  that  second  sum 
of  (8)  will  be  smaller  if  more  clauses  are  satisfied  unani¬ 
mously.  Therefore,  the  minimum  of  (8)  is  only  related  to  the 
maximum  number  of  unanimous  clauses  in  the  given  MAX- 
2UNANIMITY  problem.  Specifically,  the  minimum  of  (8)  is 
no  more  than 

M  M  2(to  —  M) 

U  +  log  (1  +  N/ 3)  +  log  (1  +  N)  +  log  (1  +  N/2)  (  } 

if  and  only  if  there  exists  an  appropriate  truth  assignment  such 
that  at  least  M  clauses  are  made  unanimous  for  the  given 
MAX-2UNANIMITY  problem.  Thus,  we  have  transformed  the 
problem  of  MAX-2UNANIMITY  problem  to  the  problem  of 
checking  if  problem  (7)  will  have  an  optimal  value  below  the 
above  threshold  (9). 


Finally,  given  an  instance  of  the  MAX-2UNANIMITY 
problem,  we  can  construct  the  harmonic  mean  rate  maxi¬ 
mization  problem  (7)  in  polynomial  time.  Since  the  MAX- 
2UNANIMITY  problem  is  NP-complete  (Lemma  3.1),  it  fol¬ 
lows  that  the  optimal  coordinated  beamforming  problem  (3) 
with  harmonic  mean  utility  is  NP-hard.  ■ 

A  few  remarks  are  in  order.  First,  it  follows  from  the  proof 
of  Theorem  3.1  that  even  if  the  optimal  transmit  power  levels 
are  known  (i.e.,  || || 2  <  T),  is  replaced  with  || || 2  =  P^), 
the  problem  of  finding  the  optimal  beamforming  directions  of 
harmonic  mean  rate  maximization  problem  is  still  NP-hard. 
Second,  we  have  set  the  noise  powers  of  users  4i  —  1,47  — 
2,  Ai  —  3,  i  =  1,  2, ...,  n,  to  zero  in  (7).  These  settings  simplify 
the  proof  and  do  not  reduce  any  generality.  We  could  have 
used  small  noise  power  values  in  the  proof  (even  though 
some  extra  argument  is  needed),  since  there  is  a  positive 
gap  between  the  global  optimal  value  and  the  local  optimal 
values  of  (7).  Finally  ,  our  proof  actually  implies  that  there 
is  a  positive  probability  (measure)  that  a  randomly  generated 
MISO  coordinated  beamforming  problem  under  the  harmonic 
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mean  utility  is  NP-hard  to  solve.  In  particular,  by  continuity,  all 
slightly  perturbed  versions  of  the  constructed  instance  (7)  (i.e., 
channel  vectors,  noise/transmit  powers  are  slightly  changed) 
will  be  equivalent  to  the  MAX-2UNANIMITY  problem.  This 
is  because  there  is  a  positive  (and  constant)  jump  in  the  global 
optimal  value  of  the  constructed  example  when  the  optimal 
value  of  the  corresponding  MAX-2UNANIMITY  problem 
increases  by  one.  When  channel  conditions  change  slightly, 
this  one-to-one  correspondence  between  the  optimal  values  of 
the  two  problems  and  the  property  of  the  discrete  jump  in  the 
optimal  value  of  the  constructed  MISO  problem  remain  valid. 


(Lemma  3.1),  it  follows  that  the  optimal  coordinated  downlink 
beamforming  problem  with  utility  H 2  is  also  NP-hard.  ■ 

E.  Maximization  of  Min-Rate  Utility 

Let  the  system  utility  function  be  given  by  II  -  II\ .  In 
this  case,  the  problem  can  be  solved  in  polynomial  time  for 
arbitrary  L  and  K  [1],  [2],  Specifically,  letting 

r  =  min  { rk }, 

1  <k<K 

the  min-rate  utility  maximization  problem  becomes 


D.  Maximization  of  Proportional  Fairness  Utility 


Like  the  harmonic  mean  utility,  we  have  the  following 
hardness  result. 

Theorem  3.2  (Proportional  Fairness  Utility):  For  the  pro- 

/  K  \  1/K 

portional  fairness  utility  H2  =  (  rk  )  ,  the  optimal  co- 

Vfc=i  / 

ordinated  downlink  beamforming  problem  can  be  transformed 
into  a  convex  optimization  problem  when  L  =  1,  but  is  NP- 
hard  when  L  >2. 

Proof:  The  first  part  of  Theorem  3.2  is  proved  in  [1].  For 
the  second  part,  the  argument  is  similar  to  that  of  Theorem  3.1. 
We  only  give  a  proof  outline  below. 

First,  we  have  the  following  lemma  whose  proof  is  provided 
in  Appendix  II. 

Lemma  3.3:  The  function  f(x)  =  log  log  (l  + 


is  strictly  convex  in  x  >  0  for  any  a. 

Second,  given  any  MAX-2UNANIMITY  problem,  an  in¬ 
stance  (10)  of  the  optimal  coordinated  downlink  beamforming 
problem  (3)  with  utility  log  //2  (equivalent  to  proportional 
fairness  utility  maximization)  and  3 n  +  2 m  users  is  con¬ 
structed  as  follows.  Notice  that  each  global  optimal  solution 
of  (10)  must  have  Vg^  =  v3i_2  =  (1,0)T,  i  =  1,2 
and  v3n+2j  =  Vgn+2j_1  =  (1,  0)T,  j  =  1, 2, ...,  to.  Moreover, 
we  consider  the  following  parametric  optimization  problem 
(similar  to  (19)  in  the  harmonic  mean  case): 


max  log  r3  +  log  r2  +  log  r  1 

s.t.  r3  =  log  (l  +  Kv^,  V0T)v3|2V 
r2  =  log  (1  +  I/O2  +  |(1, 0) v3 1  )) , 
fi  =  log  (l  +  I/O2  +  1(0)  l)v3|2))  5 
llxtll  =  t, 


where  a  >  0  is  a  constant  and  t  is  a  parameter.  The  global 
maxima  of  (11)  should  be  (f,  0)T  and  (0,f)T  when  a  is 
small.  Furthermore,  the  optimum  value  of  (1 1)  is  an  increasing 
function  with  respect  to  t  £  [0, 1].  Using  an  argument  similar 
to  that  of  the  harmonic  mean  case  for  (8),  each  globally 
optimal  beamforming  solution  v3l  of  (10)  should  be  either 
ha  or  h/,  when  N  >  3(e6m  —  1).  When  restricted  to  solutions 
of  the  form  v3i  =  ha  or  hf,,  the  maximum  of  (10)  is  only 
linearly  related  to  the  maximum  number  of  unanimous  clauses 
in  the  given  MAX-2UNANIMITY  problem.  Thus,  maximizing 
the  number  of  unanimous  clauses  is  the  same  as  solving 
(10).  Since  MAX-2UNANIMITY  problem  is  NP-complete 


max 

s.t. 


r 

r  <  log 


(12) 


II vfc||2  <  Pfc,  1  <  k  <  K. 

Given  a  r  >  0,  we  can  efficiently  check  if  there  exists 
Vfe,fc  =  1,2,...,  K,  such  that  the  constraints  in  (12)  are 
satisfied.  This  feasibility  problem  is  a  second  order  cone 
programming,  which  can  be  solved  efficiently  using  interior- 
point  methods.  The  following  theorem  is  a  generalization  of 
the  result  of  [2],  which  deals  with  the  single-cell  case. 

Theorem  3.3  (Min-Rate  Utility):  For  the  min-rate  utility, 
the  optimal  coordinated  downlink  beamforming  problem  can 
be  solved  in  polynomial  time  with  arbitrary  K  and  L. 

Proof:  We  give  the  following  bisection  algorithm  for 
solving  (12). 


A  Polynomial  Time  Algorithm  for  Min-Rate 
Utility  Maximization 

Step  1.  Initialization:  Choose  r/:  and  ru  such  that  the 
optimal  ropt  lies  in  [re,ru\  and  a  tolerance  e. 

Step  2.  If  ru  —  re  <  e,  stop,  else  go  to  Step  3. 

Step  3.  Let  rm;d  =  (re  +  ru)/ 2  and  solve  an  SOCP 
problem  to  check  the  feasibility  problem  of  (12) 
with  r  =  rmid.  If  feasible,  let  77  =  rm;d,  else  set 
ru  =  An ki  and  go  to  Step  2. 

According  to  standard  analysis  of  path-following  interior- 
point  methods.  Step  3  can  be  finished  in  0(K35L3-5)  time. 
With  regards  to  initial  choices  for  r e  and  ru,  we  can  let  v*,  = 
h/cfc  V^c/llhfcfcll  (space  matched  beamformer)  and 


rt  =  min  log 
k 


1  + 


lhLvfc|2 

Ej¥fclh^l2, 


ru 


min  log 
k 


It  takes  log2  ((rM  —  re)  / e)  iterations  to  reach  tolerance 
e.  Thus,  a  total  of  0(K3-5L3  5  log2  (( ru  —  re)  /e))  arithmetic 
operations  are  needed  in  the  worst  case.  Since 


ru~re<  log(l  +  Pfc||hfcfc||2/(j2)  <  Pfe||hfcfc||2/cr2, 


the  second  equation  in  the  next  page  holds  true,  which  is  a 
polynomial  in  the  length  of  input  data  (all  the  channel  vectors 
h kj,  noise  power  cr2  and  power  upper  bound  If  ).  Thus,  the 
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max  E  (log?’3i  +  logr3l_!  +  logr3i_2)  +  ^  (logr3n+2j  +  logr3„+2,,_i) 
*= i  j-i' 

s.t.  r3j  =  log  ^1  +  | (v^oTT,  v/03)v3,;|2^  , 


r$i- i  =  log  ^1  + 
r3l_2  =  log  ( 1  + 


|(l,0)v3i_i 
1(1) 0)v3*|2  )' 
i(i)Q)v3,-2r 

1(0)  l)v3i|2 


r3n+2j  =  log  1  + 


|htv3ll  ■  -  -|2 


3n+2j  | 


1  +  lh 


(j)V4kO')l  I 


r(j)V4|r(j)|  I 


r3n+2j-i  =  log  1  + 


|h^v3. 


n+2j—l  | 


1  +  lh-w(i)V4k(j)||2  +  lh-r(,)V4|r(i)||2, 


||v/e||2  <  1,  1  <  i  <  n,  l<j<m,  l<k<3n-\-2m. 


(10) 


K35L35  log2  ((ru  -  r*)  /e)  <  K3  5L 3  5  (|log2Pfc|  +  2 1  log2  ||hfcfc|||  +2|log2  crfc|  +  |log2e|) , 


above  algorithm  has  a  polynomial  time  worst  case  complexity. 

■ 

Remark:  The  algorithm  described  herein  easily  extend 
to  the  weighted  min-rate  maximization  problems.  In  [28], 
the  weighted  min-rate  problem  is  related  to  the  weighted 
sum  MSE  minimization  problem  and  the  weighted  sum-rate 
maximization  problems  through  the  nonnegative  matrix  theory. 

Table  II  summarizes  the  complexity  status  of  the  optimal 
coordinated  downlink  beamforming  problem  (3)  for  different 
choices  of  utilities. 


IV.  A  Cyclic  Descent  Algorithm 


In  this  section,  we  consider  numerical  algorithms  for  the 
coordinated  beamforming  problem  (3)  with  harmonic  mean 
utility  i?3: 


min  p(v1,v2,,..,vK) 

s.t.  II Vfc||2  <  Pk,  1  <  k  <  K, 


(13) 


where 


K 

p(v  l,V2,...,Vif)  =  ^2 
fc= 1 


1 

) 

Tk 


rk  =  log 


IhLvfcl2  \ 
Ofc+Ej#fcihJfcvji2y 


Since  this  problem  is  NP-hard  (proved  in  Section  III),  we 
are  led  to  develop  efficient  algorithms  to  find  a  high  quality 
approximate  solution  or  a  stationary  point  for  (13).  Due  to 
variable  separability  in  the  constraints  of  (13)  and  our  desire 
for  distributed  implementation,  we  propose  to  solve  (13)  by 
cyclicly  adjusting  the  beamforming  vector  vk  while  assuming 
the  beamforming  vectors  {vj  :  j  ^  k)  are  fixed.  In  other 
words,  we  solve  a  sequence  of  per-base  station  problems 


min  p(v1,  v2, ...,  vK) 

Vfc 


s-t.  ||vfc||2  <  Pk. 


Interestingly,  the  pricing  algorithm  introduced  in  [17]  can  be 
viewed  as  a  partially  linearized  version  of  our  cyclic  coordinate 
descent  algorithm.  In  particular,  our  proposed  algorithm  tries 
to  allocate  resources  of  the  fc-th  transmitter  by  maximizing 
the  summation  of  all  users’  utility  functions  subject  to  its 
power  constraint  (||vfc||2  <  Pk)\  while  the  pricing  algorithm 
lets  transmitter  k  maximize  its  own  utility  function  plus  the 
summation  of  the  first  order  approximation  of  all  other  users’ 
utility  functions  at  the  current  point  Ikj  =  hj^Vfcl2,  where 
Ikj  denotes  the  interference  at  the  y-th  receiver  from  the  fc-th 
transmitter. 


A.  An  Inexact  Cyclic  Coordinate  Descent  Algorithm 

The  cyclic  coordinate  descent  algorithm  is  also  known  as  the 
nonlinear  Gauss-Seidel  iteration  [32].  There  are  several  studies 
of  this  type  of  algorithms  [32-36]  with  many  applications 
in  engineering  [37].  However,  most  of  these  studies  require 
either  the  convexity  of  objective  function  or  exact  solution  of 
subproblems  (14),  which  not  only  is  costly  but  also  may  result 
in  algorithm  divergence  [33].  Below  we  consider  a  general 
differentiable  optimization  problem  with  separable  constraints 

min  /(xbx2,...,xK) 

s.t.  xfe  e  Xk,  l<k<K,  U  j 

where  the  feasible  set  X  :=  JllLi  Xk  is  separable,  bounded 
and  closed.  We  propose  an  easily  implementable  cyclic  co¬ 
ordinate  descent  algorithm  which  simply  requires  a  sufficient 
decrease  in  the  objective  of  (14)  at  each  iteration.  The  algo¬ 
rithm  can  be  applied  to  solve  the  utility  maximization  problem 
(3)  with  II  =  //  | ,  //2  and  fT3  and  have  the  same  convergence 
properties  because  they  have  smooth  objective  functions  and  a 
separable  feasible  region.  But  the  same  can  not  be  said  about 
H]  since  it  is  non-differentiable. 


(14) 


TABLE  II 

Complexity  status  of  the  optimal  coordinated  downlink  beamforming  problem 
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Utility 

Class 

Weighted  Sum-Rate 

Proportional  Fairness 

Harmonic  Mean 

Min-Rate 

L  =  1,  any  K 

NP-hard  [1] 

Convex  [1] 

Convex  [1] 

Poly,  time  Algorithm  [1],  [2] 

L  >  2,  any  I\ 

NP-hard 

NP-hard 

NP-hard 

Poly,  time  Algorithm  [1],  [2] 

An  Inexact  Cyclic  Coordinate  Descent  Algorithm 
Step  1.  Initialization:  choose  x1  =  [x-j ,  x), ....  and  a 
tolerance  e  >  0. 

Step  2.  Iteration  i  >  1:  Denote  for  k  =  1,2, ...,  K, 


»*+i 


=  (xl+1>- 


xi+1  x*+1  X 


fc+1  5  J  5 


(16) 


and  let  z 


*+i  _ 


o 


=  x 


For  k  =  1,  2, K, 

-  Compute  the  gradient  projection  direction 
for  the  component  xfc  according  to 

d^1  =  Pxk  [xj,  —  VXfc  / (z^t,\ )]  —  Xj, ,  (17) 

where  Pxk  [■]  denotes  the  orthogonal  projec¬ 
tion  to  Afc. 

-  Determine  a  stepsize  ajf1-1  using  the  back¬ 
tracking  line  search  technique  [38], 

-  Update  x^+1  =  x*fc  +  ai+1dl+1. 


Let  xi+1  =  z‘+\ 

Step  3.  Termination:  If  ||xl+1  —  x*||  <  e,  then  stop.  Else, 
set  i  =  i  +  1  go  to  Step  2. 


When  specialized  to  the  MISO  downlink  beamforming 
problem,  the  above  inexact  cyclic  optimization  procedure  can 
be  implemented  in  a  distributed  fashion.  At  the  initial  step, 
each  base  station  needs  to  know  the  local  CSI  for  all  channels 
originating  from  that  transmitter  (either  through  feedback  or 
reverse-link  estimation  [31]).  The  only  information  to  be 
exchanged  are  the  SINR  terms  at  K  receivers.  In  subsequent 
iterations,  a  base  station  updates  its  beamforming  vector  by 
solving  (14)  inexactly  using  a  gradient  projection  algorithm. 
After  that,  all  receivers  measure  individual  SINR  terms  and 
send  the  SINR  information  to  the  next  base  station.  The  inexact 
cyclic  coordinate  descent  algorithm  enables  each  transmitter 
to  update  its  beamformer  with  only  limited  information  ex¬ 
change. 

The  next  result  shows  that  the  above  Inexact  Cyclic  Coor¬ 
dinate  Descent  Algorithm  converges  to  a  KKT  point  of  (15). 
The  proof  of  this  result  is  relegated  to  Appendix  III. 

Theorem  4.1:  Suppose  /(x)  is  twice  continuously  differ¬ 
entiable  and  bounded  below,  and  the  feasible  set  X  := 
J|*=1  Xk  is  convex,  separable  and  compact.  Then  every 
accumulation  point  of  the  sequence  { x' }  generated  by  the 
inexact  cyclic  coordinate  descent  algorithm  is  a  stationary 
point  of  (15). 

The  separability  of  constraints  is  necessary  for  the  algo¬ 
rithm’s  convergence.  The  following  example  (taken  from  [35]) 
shows  that,  without  the  separability,  the  algorithm  can  get 


stuck  at  an  uninteresting  point: 

min  x\  +  x\ 
s.t.  X\  +  X2  >  2. 


This  strongly  convex  function  has  a  unique  global  solution  at 
x*  =  xi,  =  1.  However,  if  the  initial  point  is  (1.5, 0.5),  the 
cyclic  coordinate  descent  algorithm  will  be  stuck. 

Specializing  the  inexact  cyclic  coordinate  descent  algorithm 
to  the  coordinated  beamforming  problem  (13),  we  need  to  per¬ 
form  a  projected  gradient  descent  iteration  for  the  subproblem 
(14).  In  this  case,  we  have  a  ball  constraint  14  =  {v  |  ||v||2  < 
Pfc}  and  the  corresponding  projection  is  straightforward 


pvk{v)  = 


V, 


if  |jv||2  <  Pfc, 
if  l|v||2  >  Pfc. 


(18) 


As  a  variant,  we  can  also  use  the  so  called  Barzilai-Borwein 
(BB)  projection  step  for  the  subproblem  (14)  to  replace  the 
standard  gradient  projection  step.  In  particular,  at  iteration  i, 
the  BB  gradient  projection  direction  dlBB  is  given  by 


f  dBB  =  pvk  (vj.  -  -  vj., 

{  i  _  lls*-1!!2 

{  BB  (gi-tjTyi-l’ 

where  s4”1  =  v*  -v*"1,  y4"1  =  VVjfcp(v‘)  -  V^v4"1).  It 

can  be  shown  that  the  above  BB  gradient  projection  direction 
is  always  a  descent  direction.  The  P-linear  convergence  of  the 
BB  method  has  been  established  for  strongly  convex  quadratic 
functions  in  [39], 

If  we  use  a  single  gradient  projection  step  to  inexactly  solve 
the  partially  linearized  subproblems  in  the  pricing  algorithm 
[17],  then  the  resulting  inexact  pricing  algorithm  is  algorith¬ 
mically  identical  to  the  inexact  cyclic  coordinate  descent  algo¬ 
rithm  considered  herein.  This  is  because  the  gradient  vectors 
of  the  two  utility  functions  in  the  respective  subproblems  (i.e., 
p(v)  and  its  partially  linearized  versions)  are  exactly  the  same. 
This  observation,  coupled  with  Theorem  4.1,  immediately 
implies  the  convergence  (to  a  KKT  solution)  of  the  inexact 
pricing  algorithm  for  MISO  interference  channel.  The  latter 
is  interesting  since  the  convergence  of  the  original  pricing 
algorithm  has  not  yet  been  established. 


V.  Numerical  Simulations 

To  evaluate  the  effectiveness  of  the  cyclic  descent  algorithm, 
we  have  conducted  numerical  simulations  for  a  7-cell  network 
with  one  user  per  cell  as  shown  in  Fig.  1.  Each  base  station 
is  equipped  with  L  antennas.  Similar  to  [5],  standard  WiMax 
parameters  are  used  in  all  the  simulations;  see  Table  III,  where 
d  is  the  distance  in  kilometers.  The  location  of  each  remote 
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TABLE  III 

Standard  WiMax  Parameters 


Model  or  Parameters 

Values 

noise  power  spectral  density 

—  162  dBm/Hz 

path  loss  model 

128.1  +  37.6  log10(d) 

log-normal  shadowing 

8  dB 

distance  between  neighboring  base  stations 

2.8km 

antenna  gain 

15  dBi 

Fig.  1.  A  wireless  network  with  seven  base  stations  and  single  user  per  cell. 


user  is  chosen  randomly  within  its  cell  such  that  it  is  at  least 
0.5km  away  from  the  corresponding  base  station. 

Figure  2  plots  the  iteration  process  of  BB  projection  method 
for  the  coordinated  downlink  beamforming  problem  with 
harmonic  mean  utility.  It  can  be  seen  that  most  of  improvement 
is  achieved  in  the  first  1-2  iterations,  making  the  method 
attractive  for  practical  implementations. 

For  a  two-user  MISO  channel,  a  parametrization  of  the 
achievable  rate  region  boundary  was  given  in  [8],  [24],  We 
can  use  this  parametrization  to  compute  the  global  optimal 
solution  of  the  coordinated  MISO  downlink  beamforming 
problem  by  searching  along  the  rate  region  boundary.  In  Fig.  3, 
the  performance  of  our  proposed  cyclic  descent  algorithm 
is  compared  against  the  global  optimum  for  50  randomly 
generated  two-user  MISO  channel.  It  can  be  seen  that  the 
proposed  algorithm  either  achieves,  or  nearly  achieves,  the 
global  optimality. 

Figure  4  shows  the  performance  achieved  by  the  cyclic 
coordinate  descent  algorithm  versus  different  number  of  an¬ 
tennas  for  various  utility  functions  and  with  a  fixed  transmit 
power  P  =  30  dBm  at  each  base  station.  Each  point  in 
Fig.  4  is  obtained  by  averaging  over  500  independent  channel 
realizations.  Space  matched  beamformers  are  used  as  the 
benchmark.  It  can  be  seen  that  the  transmit  rates  improve 
significantly  over  the  benchmark  solution.  When  the  number 
of  users  ( K  =  7)  and  the  transmit  power  ( P  =  30  dBm)  are 
fixed,  the  utility  increases  linearly  with  the  number  of  antennas 
L,  suggesting  an  additive  system  gain  with  increasing  L. 


Fig.  2.  A  typical  iteration  process  of  the  cyclic  coordinate  descent  algorithm 
with  K  =  7,  L  =  4  and  P  =  30  dBm. 


Fig.  3.  Performance  comparison  of  the  proposed  algorithm  and  the 
parametrization  method  with  K  =  2,  P  =  30  dBm. 

VI.  Conclusion 

Coordinated  transmit  beamforming  is  a  promising  approach 
for  interference  mitigation  in  a  MISO  interference  channel.  A 
major  design  challenge  is  to  find,  for  a  given  channel  state, 
a  globally  optimal  beamforming  strategy  under  an  appropriate 
utility  criterion.  In  the  single  carrier  case  with  a  single  antenna 
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Benchmark  for  Min-Rate  Utility 
Min-Rate  Utility 

Benchmark  for  Harmonic  Mean  Utility 
Harmonic  Mean  Utility 
Benchmark  for  Proportional  Fairness  Utility  :■ 
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4  5 

Number  of  Antennas 


Fig.  4.  Performance  comparison  of  coordinated  beamformers  with  space 
matched  beamformers  with  K  =  7,  P  =  30  dBm. 


per  transmitter,  maximizing  the  (weighted)  sum-rate  is  known 
to  be  NP-hard.  However,  the  same  problem  is  polynomial  time 
solvable  when  the  proportional  fairness,  harmonic  mean  or 
max-min  utility  is  used.  It  turns  out  the  situation  with  multiple 
transmit  antennas  is  rather  different.  For  instance,  when  each 
transmitter  (e.g.,  a  base  station)  is  equipped  with  two  antennas, 
the  corresponding  joint  beamformer  design  problem  becomes 
NP-hard  under  either  the  proportional  fairness  or  the  harmonic 
mean  criterion.  These  complexity  results  suggest  that  we 
should  abandon  effort  to  find  globally  optimal  beamformers 
for  a  general  MISO  interference  channel  unless  the  max-min 
utility  is  used.  In  the  latter  case  the  problem  remains  solvable 
in  polynomial  time. 

Motivated  by  these  complexity  results,  we  propose  a  simple 
distributed  inexact  cyclic  coordinate  descent  algorithm  to 
find  a  (locally  optimal)  beamforming  strategy.  Our  algorithm 
exploits  the  separable  structure  of  the  power  constraints,  and 
is  provably  globally  convergent  to  a  KKT  solution.  This 
algorithm  requires  only  local  CSI  and  an  exchange  of  local 
SINRs  at  each  iteration.  Numerical  experiments  with  WiMax 
system  parameters  show  that  the  proposed  algorithm  is  both 
effective  and  efficient,  providing  significant  rate  gain  over  the 
space  matched  beamforming  strategy. 
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Appendix  I 
Proof  of  Lemma  3.2 

The  proof  consists  of  establishing  three  claims. 

Claim  1:  The  function  log-1  ^1  +  (cr2  +  x)  ^  is  strictly 
concave  in  x  >  0  for  any  a  f  0.  Furthermore,  (1,0)T 


and  (0,1)T  are  the  only  global  minima  for  the  optimization 
problem 

min  log^1  ^1  +  N  N)+log~1[l+  N  \ 

V  <J  +  x )  \  cr2  +y ) 

s.t.  x  +  y  =  1,  x  >  0,  y  >  0, 

where  TV  >  0. 

To  establish  Claim  1,  we  first  show  the  strict  concavity  of 
r(x)  =  log-1  ^1  +  (cr2  +  x)  ^  .  Since 

/(x)=  (I  +  ^  +  Ih^  +  x)’ 

=  2r2(j)  (r( x)  -  (l/2  +  a2+x)) 

(1  +  a2  +  x)2  (cr2  +  x)2 

it  follows  that 

r"{  x)  <  0  •<=>■ 

g( x)  =  log  (1  +  1  / (cr2  +  x ))  -  l/(l/2  +  cr2  +  x)  >0. 

Let  z  =  l/{a2  +x)  and  consider  h{z)  =  log(l  +  z)  — 
22/(2  +  z).  Since 

M0)  =  0  and  h'{z)  =  ^1+^z  +  2y  >  °>  V  ~  >  °> 

it  follows  that  g(x)  =  h(z)  >  0  for  all  x  >  0,  implying 
the  strict  concavity  of  r(x)  over  the  interval  (0,oo).  Since 
affine  transformation  does  not  change  strict  concavity  of  a 
function,  this  implies  that  log-1  (l  +  N/(cr2  +  x))  is  also 
strictly  concave  for  x  >  0.  Finally,  since  the  minimum  of 
a  strictly  concave  function  over  a  polytope  is  always  attained 
at  a  vertex  [40],  we  have  established  the  claim. 

To  establish  Lemma  3.2,  let  us  consider  the  following 
parametric  optimization  problem  in  R2 

1111 

min - 1 - 1 - 1 - 

r4  r3  r2  n 

s.t.  r4  =  log  (l  +  0.9|(1, 1) v | 2)  , 

r3  =  log  (1  + 1/|(1,  l)v|2),  (19) 

r2  =  log  1  +  100/1(1, 0)v|2), 

r4  =  log  (1  +  100/|(0,  l)v|2)  , 

IMI  =  t, 

where  v  =  (vi,v2)t,  and  t  >  0  is  a  parameter. 

Claim  2:  Let  f(t)  denote  the  minimum  value  of  (19).  The 
following  properties  hold: 

1)  f(t )  is  a  strictly  decreasing  function  in  [0, 1], 

2)  When  t  £  (0.95, 1],  the  global  minima  of  (19)  are  (t.  0)T 
and  (0,t)T, 

3)  fit)  is  an  increasing  function  in  [0.95,1]. 

Let  us  argue  that  Claim  2  is  true.  First,  if  part  2)  is  true, 
then  we  can  check  part  3)  directly  by  first  computing  /(f)  as 
the  objective  value  of  (19)  at  its  optimal  solutions  (f,  0)T  or 
(0,f)T,  and  then  verifying  that  /"(f)  >  0  for  f  e  [0.95,1]. 
We  omit  the  details  of  computation  for  space  reason.  So  we 
only  need  to  argue  parts  1)  and  2). 
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When  £  is  fixed,  the  KKT  condition  for  problem  (19)  can 
be  written  as 


4>{v  i,v2) +  4>(vi)  =  Xvi, 

4>(v!,v2)  +  ip(v2)  =  Xv2, 


(20) 


vl 


■vl 


=  £2, 


where  (f>(x,  y)  and  ip(x)  are  given  above  and  A  is  the  La- 
grangian  multiplier  associated  with  the  constraint  ||v||  =  f  in 
\/2 

(19).  It  can  be  seen  that  ^  (f,  t)T  is  always  a  solution  to  (20). 

Moreover,  (£,  0)T  and  (0 ,  f)T  are  the  (only)  non-differentiable 
points  of  (19).  Suppose  (vi(t),v2(t))T ,  where  vi(t)  f  v2(t), 
are  other  KKT  solutions  to  (20)  (if  there  are  any).  Notice  that 
the  global  minimum  /(£)  is  always  attained  at  a  KKT  point 
or  at  a  non-differentiable  point  of  (19).  Thus,  we  must  have 

f(t )  =  .min  {fA(t),fB(t),fc(t)}, 

{A,B,C} 

where  /U(t)  and  fsif)  are  given  in  the  following  page  and 
fc(t)  is  the  objective  value  of  (19)  at  (vi(t),  v2{i))T . 

Next,  we  claim  that  /(£)  is  a  decreasing  function  in  £  £ 
[0,1].  We  prove  this  by  examining  the  monotonicity  of  the 
component  functions  /a (t) ,  /b(£)  and  By  (20),  we 

have 

=  fjvft))  ~f{v2(i)) 

Vi  (t)-v2(t) 

where  the  last  step  follows  from  the  fact  that  ifi(x)  is  a  de¬ 
creasing  function  for  x  £  [0,1],  Therefore,  we  conclude  from 
the  standard  sensitivity  analysis  [38]  of  duality  multipliers  that 
fc(t)  decreases  as  £  increases  in  [0,1].  For  t  £  [0,0.75],  it 
can  be  checked  that  all  of  /,;(£),£  =  A,B,C,  are  decreasing 
functions.  Thus,  /(f)  decreases  monotonically  in  [0,0.75]. 
For  t  £  (0.75,0.85),  the  global  minimizer  of  (19)  is  neither 
yj  2 

-^-(£,£)7  nor  (f,  0)T  or  (0,f)T  as  we  always  can  find  a 
point  at  which  the  objective  function  is  smaller  than  /4(f)  and 
/b(£).  As  a  result,  /(f)  =  fc(t)  decreases  in  £  £  (0.75,0.85). 
For  £  £  [0.85,1],  we  have  know  /s(f)  <  /a(£),  so  /(f)  = 
min {/b(£),  fc(t)}-  As  both  of  /#(£)  and  /c(£)  are  decreasing 
functions  in  t  £  [0.85, 1],  we  know  /(f)  decreases  in  [0.85, 1], 
This  completes  the  proof  that  /(f)  is  monotonically  decreasing 
in  [0,1]. 

We  next  prove  part  2)  of  Claim  2:  namely  for  f  £  [0.95, 1], 
the  minima  of  (19)  are  (£,  0)T  and  (0,f)T.  We  can  use 
v2  =  \Jt 2  —  v\  to  transform  (19)  into  an  unconstrained 
univariate  optimization  problem,  where  —  f  <  v\  <  t. 
The  global  minimizer  (vi,v2)t  of  (19)  should  satisfy 
that  v±v2  >  0,  else  we  can  find  (v\,v2)T  such  that 

V\  +  v2  =  v\  +  v2,  | t;i|  <  |ui|  and  |u2|  <  \v2\1  at  which 
objective  would  be  lower.  Thus,  we  only  need  to  consider 
the  case  V\V2  >  0.  Due  to  the  symmetry  of  (19),  we  only 
consider  the  case  0  <  V\  <  \/2t/2.  It  can  be  checked  that  for 
any  f  £  [0.95,1],  the  objective  function  of  (19)  increases  as 
V\  increases  in  [0,  v/2f/2].  Hence,  (0,f)T  and  (f,0)T  are  the 
global  minima  for  (19)  with  t  £  [0.95, 1]. 


Claim  3:  When  N  >  2(e40m  —  1),  the  global  minima  vj,  of 
(8)  must  have  unit-norm  || ||  =  l,i  =  1,2,  ...,n. 


By  symmetry,  we  only  need  to  prove  ||v|||  =  1.  Let  us 
consider  the  following  parametric  optimization  problem  (21) 
in  V4  with  the  other  variables  V4.J,  i  =  2,3,  ...,n,  fixed,  and  f 
as  a  parameter.  Let  g(t)  be  the  optimum  value  of  (21)  and  let 
<?i(£)i  32(f)  denote  respectively  the  values  of 


- 1 - 1 - 1 - and 

r  1  r2  r3  r4 


1 

- h 

^  4n+2j 


1 


^471+2^  —  1 


when  evaluated  at  the  global  minima  of  (21).  By  definition, 
/(f)  <  3i(f)  f°r  all  f  £  [0,1].  We  claim  that  /(f)  =  31(f) 
for  f  £  [0.95,1].  To  see  this,  we  consider  the  optimization 
problem  (22).  Define  |h]lV4|2  =  x,  then  | V4 12  =  t2  —  x,  and 
the  objective  function  of  (22)  is  only  dependent  on  x,  while 
its  constraint  || V4 1|  =  f  can  be  transformed  to  0  <  x  <  f2. 
Using  Claim  1,  we  see  that  the  objective  function  of  (22)  is 
strictly  concave  in  x.  As  a  result,  the  globally  optimal  x  can 
only  be  attained  at  the  end  points  of  the  constraint  interval 
[0,f2].  In  other  words,  the  global  minimizer  of  (22)  must  be 
either  (f,  0)T  or  (0,f)T.  Since  the  global  minima  of  (19)  are 
also  (f,  0)T  and  (0,f)T  when  f  £  [0.95,1],  it  follows  that 
the  global  minimizer  of  (21)  is  either  (f,  0)T  or  (0,f)T,  if 
f  £  [0.95, 1].  This  implies  that  /(f)  =  31(f)  in  [0.95, 1]. 

Next,  we  prove  that  if  N  >  2(e40m  —  1),  then  g(l)  <  g(t) 
for  t  £  [0, 1).  Due  to  the  choice  of  N,  it  can  be  checked  that 


/'(!)  + 


2  TO 

log2(l  +  N/  3) 


<0, 


/( 0.95)  >  /( 1)  - 

Then,  we  have 

1)  For  any  f  €  [0.95, 1], 

3'(f)  =  3i (f)  ‘ 

<  /'(l) 

<  /'(l) 


2m 


log(l  +  N/2) ' 


9^)  =  f(t) 

-m 

2m 

'  log2(l  +  JV/3) 


32(f) 


<  0. 


The  second  equality  is  due  to  the  fact  /(f)  =  31(f)  for 
f  £  [0.95, 1],  The  first  inequality  holds  since  f(t)  is  an 
increasing  function  in  [0.95, 1]  (cf.  part  3)  of  Claim  2), 
while  the  second  inequality  follows  from 

2  777, 

32(f)  <  2  /  -]  .  1\T  ZO\  ’  fOT  aH  1  G  I0'95’  11- 

log  (1  +  A/3) 

The  non-positiveness  of  g'(t)  over  f  £  [0.95,1]  implies 
3(f)  >  3(1)’  for  f  e  [0.95,1). 

2)  For  any  t  £  [0,0.95],  we  have 


3(f)  >  3i(f)  >  /(f) 

>  /(0.95)  >  /(l)  +  ,og(1 

>  3i(l)  +  32(f)  =  3(1), 


2m 


N/2) 


where  the  third  inequality  holds  because  /(f)  is  a  strictly 
decreasing  function  in  [0, 1]  (cf.  part  1)  of  Claim  2)  and 
the  last  inequality  is  due  to 

2777, 

log(l  +  JV/2)  2  92<1).  f0r  8,1  *  e  |0- 1]^ 
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<i>(x,y) 

ip(x) 


_ -1-8  ( x  +  y) _ ^ _ 2 _ 

log2  (l  +  0.9  (x  +  y)2J  (l  +  0.9  (x  +  y)2J  log2  (l  +  l/(x  +  yfj  ((x  +  y)2  +  l)  (x  +  y) 
200 

log2  (1  +  100/a:2)  (100  +  a:2)  x' 


j  fA(t) 

{  fB(t ) 


1  ,  1  ,  2 

log  (1  +  1.8£2)  +  log  (1  +  0.5/t2)  +  log  (1  +  200 /t2)’ 

log  (1  +  0.9 12)  +  log(l  +  l/£2)  +  log  (1  +  100  A2)  ’ 


(1 

min 

— 

v4 

\r4 

s.t. 

r4  = 

T3  = 

T2  = 

n  = 

1  1  1  1\  ^ 

- 1 - 1 - 1 - I-!-/ 

r4  r 3  r2  n  J  2— ' 


(  1 


a'=i 

I2), 

E 

^4|2) 

,  |2\ 


1  +  4 1|  = 

T4n+2j  =  log 


1  +  N/  1+  |h 


\f'4n-\-2j  T'4n-\-2j  —  l 


U)V*\*U)\ I 


^4n-\-2j  —  l  =  log  (l  +  N/(l  +  |ht_7r(j)v4|7r0-)l 
1  <  j  <  to. 


l^(a)V4lT(j)| 
\2  +  \hf 


(j)VMr(j)\ I 


(21) 


min 

v4 

s.t. 


E 


f  i 


An-\-2j  ^4n+2j  — 1 


llv4||  =  t, 


^4n+2j  —  log  (l  +  N/(l  +  |h+(j.)V|47r(j-)||2  +  |h^.)V|4rW)||2)):, 
rin+2j-i  =  log  (l  +  N/(l  +  |ht_7r(j)v4|7r0-)||2  +  |ht_T0-)v4|r(i)| 


1  <  j  <  TO. 


(22) 


Combining  the  above  two  steps  shows  g(t)  >  5(1)  for  all 
t  £  [0,1).  It  follows  that  the  minimum  of  g(t)  over  [0,1]  is 
attained  at  t  =  1.  This  establishes  Claim  3. 

Finally,  notice  that  Claim  3  implies  that  problem  (8)  is 
equivalent  to  problem  (21)  with  t  =  1,  when  N  >  2(e40m  — 1). 
Since  the  global  optimal  solution  of  (21)  must  be  either  (0,  t)T 
or  (f,  0)T  for  t  £  [0.95, 1],  it  follows  that  the  optimal  solutions 
v4i,  i  =  1,2, ...,  K,  for  problem  (8)  should  be  either  (1, 0)T 
or  (0, 1)T  when  N  >  2(e40m  —  1). 


where  g{x)  =  (26+2a;+l)r(a:)  and  b  =  a2  >  0.  Let  y  =  a :+b, 
then  g(x)  becomes  h(y)  =  (2 y  +  1)  log(l  +  1  /y).  It  suffices 
to  prove  that  h(y)  >  1  for  all  y  >  0.  Since 


h’(v) 


2  log 


(2y  +  l) 
y(y  +  1) 


and 


h"(v) 


1 

y2{y  +  l)2 


>0, 


Appendix  II 
Proof  of  Lemma  3.3 

Proof:  Let  r(x)  =  log  (1  +  1/(6  +  x)),  then  we  have 
f(x)  =  logr(x),  and 


we  know  h'(y)  is  an  increasing  function.  Since 

liiny _ >+oc  h'(y)  =  0,  it  follows  that  h'(y)  <  0  and  h(y)  is  a 

decreasing  function.  Notice  that  limy _ ^  h(y)  =  2.  Thus, 

we  have  h(y)  >  2  >  1  for  all  y  >  0.  This  further  implies  that 
log  log  (1  +  1/(6  +  x))  is  strictly  convex  for  x  >  0.  ■ 


and 


r'(x) 


-1 

(6  +  1  +  x)(b  +  x)  ’ 


f'(x) 


r'(  x) 
r{x) 


/"(*) 


(y{x)  - 1) 

(b  +  1  +  x)2(b  +  x)2r2(x)  ’ 


Appendix  III 

Convergence  of  the  Cyclic  Coordinate  Descent 
Algorithm 

We  first  need  to  estimate  the  step  length. 
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Claim  1:  Suppose  c\  £  (0,1),  V/i(y*)Td*  <  0  and 

max  ||V2/i(y)||  <  B,  where  h( y)  is  a  multi-variable  function 
ye  Y 

associated  with  y  and  Y  is  the  feasible  region.  Consider  the 
Armijo  step  length  rule  [38]  whereby  ctj  :=  with  7  £  (0, 1) 
and  l  >  0  being  the  smallest  integer  satisfying 


h(yi  +  7^)  <  h(y)  +  c17£V/i(yi)Tdt. 

Then  we  have 

2(ci  -  l)V/i(yi)Tdi 


1  >  >  min  <  1, 


s7l|d 


t  II 2 


(23) 


(24) 


and 


Hyl)  -  My*  +  a*d*) 


>min|-ClVWV, 


f37||d*||2  j  • 

(25) 

Let  us  argue  Claim  1  holds.  Suppose  that  the  step  length 
cti  =  1  is  not  accepted.  In  this  case,  a,;  will  be  the  largest  step 
satisfying  the  sufficient  decrease  condition  (23),  implying 

h(yl  +  aa^d1)  >  h{ yl)  +  ciai7_1VIi(yi)Tdi.  (26) 


By  Taylor  expansion,  (27)  in  the  next  page  holds  true.  Com¬ 
bining  (26)  and  (27)  yields  (24).  Substituting  (24)  into  the 
sufficient  decrease  condition  (23),  we  immediately  obtain  (25), 
which  establishes  Claim  1. 

We  now  proceed  with  the  proof  of  Theorem  4.1.  The  basic 
idea  is  based  on  the  contradiction  principle.  More  exactly,  if 
there  is  no  convergence,  we  can  find  one  descent  direction 
which  can  provide  a  sufficient  descent  in  the  objective  function 
value  and  then  obtain  a  contradiction. 

At  first,  since  the  iterates  {x1}  lie  in  a  compact  set  X,  there 
must  exist  an  accumulation  point  for  { x' } .  Let  x  denote  an 
accumulation  point  such  that 

x  =  lim  x* 

££X 0  ,i—>  00 

for  some  subsequence  indexed  by  Iq.  Since  the  feasible  set 
X  is  closed,  x  must  also  be  feasible.  Furthermore,  since  the 
projection  mapping  and  the  function  /  are  both  continuous,  it 
follows  that 


lim  di+1  =  lim  PXl(xj-VXl/(z*))-xi 

— >00  l€-L  o,2— »00 

=  PXl  [xi  -  VXl/(x)]  -  xi  =  di 
and 

lim  /(x*)  =  /(x). 

fGl  q,i — >00 

Notice  that  the  function  values  {/(x*)}  are  decreasing  and 
bounded  below,  then  /(x*)  — »  /(x).  Since  /(x)  is  twice 
continuously  differentiable  and  the  feasible  set  X  is  bounded, 
it  follows  that 

B  = *.  ;n.ax  ^  ma?  nvxfc/(x)ii  <  00 • 

Because  the  projection  operator  is  non-expansive  and  V/(x) 
is  continuous  in  bounded  region,  we  obtain  from  (17)  that 

K+1||  =  \\PxM-^J«+-\)}-PxkK}\\ 

<  IK4  - vXfc/(z*t\)]  -4||  =  ||vXfc/(z*+_11)||. 


Denoting  maxx6x  ||V/(x)||  —  M,  hence  we  have 

IK+1||  <  ||VXfc/(z^11)||  <  M  <  +00.  (28) 

The  lower  bound  and  upper  bound  on  ||VXfc/(z]|.']l1)||  is  very 
helpful  in  estimating  the  decrease  value  of  the  function. 

We  proceed  by  contradiction  and  suppose  x  is  not  a  KKT 
point.  Then  d  =  Px[x  —  V/(x)]  —  x  ^  0  so  that  5  —  ||d||  >  0. 
Let 

k*  =  min {k  \  4  =  ||dfc||  >  0}, 

and  suppose  k*  >  1  without  loss  of  generality.  By  definition, 
we  have 

lim  ||d](+1||  =  4  =  0,  for  k  <  k* . 

££Xo,£— »oo 

Recall  the  definition  of  z^+1  (16).  Since 

IK+1  -x||  =  ||xj  +  ai+1di+1  -  Xi|| 

<  ||xi-x1||+ai+1||di+1|| 

<  ||xj -Xl|j  +  ||dl+1||  — >  0, 

l(zl.0,l—>OO 

we  have  limigx0,woo  zl+1  =  x-  In  general,  the  same  argument 
shows  that 


lim 


£6Xo,£— »oo 

Consequently,  there  holds 


zi+1  —  x 
zfe  — 


V  k  <  k*. 


lim  VXfc./(z  *+i1)  =  VXfc./(x) 


££Xq,£— »oo 


lim  dl.T  =  dk* 

£GXq,£— ^  00 


(29) 


Let  us  use  9k-  to  denote  the  angle  between  d/.«  and 
VXfc./(x).  Since  yfc«  =  xfc»  -  VXfc,/(x)  £  Xk, ,  it  follows 
from  the  property  of  projection  that 

(xfc.  -  PXk,  [yfc*])T  (n-  -  Pxk.  [yfc*])  <  0, 


which  further  implies 

\\Pxk,  [yfc*]  -_x/c*  ||2  _ 

<  (xfc*  -  Pxk,  [y/c*])T(xfc*  -  yfc*) 

=  ||-Px'fe,  [y*:*]  —  HU  vXfc» /(5c)  ||  cos  . 

Canceling  the  factor  \\Pxk,.  [yfc*]  —  Xfc*|  7^  0  from  both  sides 
yields 

4*  =  ||dfc*  ||  =  ||Px>  [yfc*]  -  xfc*  || 

<  ||VXfc,/(x)||  cos  6»fc. 

<  M  cos  9k * , 


where  the  last  inequality  follows  from  (28).  Thus,  we  have 


cos  9k*  > 


4* 

~M' 


(30) 


Now  we  can  use  the  property  (28),  (29)  and  (30)  to  conclude 


|VXfc,/(z*ti1)||>|K+1||>^  and 


4* 


cos  >  ^k* 
C0S  fc*  -  2 M’ 


for  all  i  £  To  and  i  >  z(l,  where  Iq  is  a  sufficiently  large  integer 
and  denotes  the  angle  between  VXfc,  /(z£ t^)  and  d^t1. 

Now  we  can  use  (23)-(25)  of  Claim  1  to  obtain  a  contradic¬ 
tion.  In  particular,  we  consider  (31),  which  is  a  contradiction. 
The  fourth  inequality  is  due  to  the  sufficient  decrease  condition 
and  the  fifth  inequality  is  due  to  Claim  1.  Therefore,  x  is  a 
stationary  point. 
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2  —2 

h(  r  +  =  nr) + aa-^nry  &  + 


<h{yl)  +  ^l~  ^h{d)  cT  + 


2  —2 

iT,i  ,  <+:  7 


Slid1 


(27) 


+oo  >  £  (/(x4)  -  /(x^1))  >  53  (/(x*)  -  /(xi+1)) 

i  fc£X0 

>  X!  (-c1a*t1VXfc./(zj+i1)rdj+1 

iGXo 

(vXfc. /(ztti1)rd’t1) 


*ex0 


=  E 


*ex0 


S7l|difct1||2 

^2Cl(l  -  Cl)  (||VXfc./(zt+l1)||||d»-t1||  cosgjt1)^ 
JB7||d*t1||2 

2ci(l  —  Ci)  (||VXfc,/(zl+i1)||cos6»*+1)2\ 


> 


Cl(l  —  cl)^fc* 

8BjM'2 


=  +00, 


(31) 
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